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SOFTWARE INSTRUCTIONS UTILISING A HARDWIRED CIRCUIT 

BACKGROUND OF THE INVENTION 

Field of the Invention 

The present invention relates to a circuit arranged to perform a cyclic 
5 redundancy check (CRC) on a received data stream. The circuit may additionally 
be used to perform other forms of coding on selected data. 

Description of the Related Art 

Error coding is often added to data when there is a possibility that the 
data may be corrupted or otherwise interfered with during transmission. Simple 
10 error coding may include simply adding a parity bit which indicates to the receiver 
that the data received is not identical with the data transmitted. However, such 
simple schemes do not protect for cases where more than a single bit is at fault. 
This can pose problems when, in reality, a burst error may corrupt several bits in a 
message. 

15 Cyclic Redundancy Checking (CRC) is one form of error checking 

employed in a wide variety of data traffic systems. CRC uses a system of 

polynomial division to code a given string of data. 

For example, assume the message to be transmitted is a multi-bit 

message, M(D).D^. This message is divided, using modulo-2 arithmetic, by a so- 
20 called generator polynomial, G(D). This division yields a division polynomial, Y(D), 

and a remainder polynomial R(D). 
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The long division process (normal, or modulo-2 for polynomials) 
yields Y(D) and R(D) as two separate outputs. In practice, only the remainder, 
R(D), is of interest, and Y(D) is discarded. The message, M(D) is then normally 
appended by N zero digits, or simply shifted left N positions, where N is the order 
5 of the generator polynomial, G(D). The data that is transmitted is therefore: 
M(D).D^ + R(D). 

The data may be transmitted over any one of a number of different 
interfaces. During the course of the transmission, an error E(D) may occur. The 
error may affect one or more bits of the message, which means that the receiver 
10 will receive M(D).D'^ + R(D) + E(D). 

In order to verify the authenticity of the received data, the receiver 
divides the received data by the same generator polynomial, G(D). 

m{d).d'' ^r{d)-^e{d) ^ m{d}d'' ^ r{d) ^ e{d) 
g{d) g{d) ^ g{d)'^ g{d) 

The first term in the result of equation (2) yields Y(D) and R(D) as 
15 before. The term R(D)/G(D) yields just the remainder R(D). The term E(D)/G(D) 
yields e(D) and r(D) which represent the errors introduced by E(D). 

The result of the division by the receiver, may therefore be written as: 

e(Z)))+ R{d)-^ /?(/))+ r(Z)) (3) 

Y(D) is not significant, as before, and may simply he discarded. The 
20 same is true for e(D). which is the result of E(D) divided by G(D). In practice, Y(D) 
and e(D) are combined and cannot be separated. 

Accordingly, if no error E(D) has occurred during transmission, then 
r(D) is zero. If an error E(D) has occurred, then r(D) will have a non-zero value. In 
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such a case, the receiver may choose to ignore the received data, flag it as 
erroneous and/or request a re-transmission. 

As an example of this procedure, consider a binary stream of 192 
bits: {101 1 ...01 1}. Representing this as a polynomial already appended (or 
5 shifted) with eights *0's to take into account the order of the generator, G(D): 



1.D^^^ + 0.D^^®+ 1.D^^^+ 1.D^®^ + ... +0.D^°+ 1.D^ + 1.D® + 

O.D^ + O.D^ + O.D^ + ... + O.D^ + 0.D° (4) 

The generator polynomial, G(D), of order N=8, in this case is: 

+ + + + + D° (5) 



10 In hardware, the modulo-2 division may be implemented as shown in 

Figure 1. Figure 1 shows a circuit 20, including a string of sequentially connected 
registers 22. An XOR gate 24 is interposed between adjacent shift registers at the 
appropriate positions to realize the algorithm of equation (5). 

The data to be encoded is shifted into the coder 20 as shown in 

15 Figure 1 . After all the data bits have been shifted into the coder, a sequence of 
zero bits are shifted in. The sequence consists of as many zero bits as there are 
bits in the coder. In the present case, the sequence of zero bits is 8 bits long. 

After the sequence of zero bits has been loaded into the coder, the 
coder 20 contains the remainder R(D) of the coding division process. 

20 The same coder circuit may be used for encoding and decoding. For 

encoding, the procedure is as described above. For decoding received data, the 
message data is first shifted into the coder 20, followed by the received remainder, 
R(D). If the coder 20 stores a zero value after the remainder has been shifted in, 
this indicates that the message and remainder sequence were received without 
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error. A non-zero result indicates that either the message or remainder contains 
an error. 

The CRC function can also be implemented in software. The 
following code demonstrates a prior art realization. 

5 

for (i=0; i<data_len; i++) 
{ 

C=M[i]; //Get the next input data bit 

C= C CRC[0]; //XOR data bit with LSB of CRC register 

10 if (CRC[0]==1) 

{ 

for (j=0; j<CRC_len; j++) 
{ 

CRC[j] = CRC[j] CRC_poly[ j] ; 
15 //XOR CRC bits with generator polynomial bits 

} 

) 

//Shift CRC register bits right by one position 
for (j=0; j<CRC len - 1; j++) 

20 { " 

CRC[j] = CRC[j+l] ; 

} 

CRC[CRC_len - 1] = C//C becomes the new MSB of CRC register 

) . 

25 

From the code above, it is evident that using a normal DSP (Digital 
Signal Processor) or MCU (Microprocessor Unit), a total of 3 - 5 instruction cycles 
would be required each time the LSB of the CRC register is checked for a '1', 
performing the XOR and shift operations. The speed of this operation is further 
30 reduced when the length of the CRC register is higher than the data width of the 
processor or DSP accumulators, which are normally 8, 16, 24 or 32 bits. In such 
cases, two or more sets of XOR and shift are required. 



BRIEF SUMMARY OF THE INVENTION 

According to a first aspect of the present invention, there is provided 
35 a method of calculating a Cyclic Redundancy Check (CRC) value for a multi-bit 
input data word, comprising a defined generator polynomial, including the steps of: 
a) serially shifting at least a portion of the input data word into a 
register; 



4 



b) XORing the contents of the register with the generator 
polynomial if the LSB of the register is M'; 

c) shifting the contents of the register right by one position; 

d) shifting into the MSB position of the register a new bit of the 
input data word, having been XORed with the LSB of the 
register: 

e) repeating step d) for all message data bits; 

f) shifting into the register a number of 'O's equal to the length of 
the generator polynomial; and reading from the register the 
calculated CRC value. 

According to a second aspect of the present invention, there is 
provided an apparatus for calculating a Cyclic Redundancy Check (CRC) value for 
a multi-bit input data word, using a defined generator polynomial, including: 

• a register for serially receiving at least a portion of the input data 
word; 

• an XOR gate for XORing the contents of the register with the 
generator polynomial if the LSB of the register is '1'; 

• means for shifting the contents of the register right by one position; 

• means for repeatedly shifting into the MSB position of the register a 
new bit of the input data word, having been XORed with LSB of the 
register, until all bits of the input data word have been shifted into the 
register; and 

• means for shifting into the register a number of 'O's equal to the 
length of the generator polynomial. 

Embodiments of the invention may be used to perform CRC 
calculations on data quickly and efficiently as they may be implemented in an MCU 



such that the calculations are completed in a few clock cycles using specialized 
hardware which may be called by an appropriate instruction in software. 

The basic apparatus and circuit may further be used in a variety of 
different coding schemes including Turbo coding, PN sequence generation and 
5 convolutional coding. Such coding schemes benefit from the more speedy 
execution achieved through use of the custom hardware provided. 

BRIEF DESCRIPTION OF THE DRAWINGS 

For a better understanding of the present invention and to 
understand how the same may be brought into effect, the invention will now be 
10 described by way of example only, with reference to the appended drawings in 

which: 

Figure 1 shows a string of registers for implementing a generator 

polynomial; 

Figure 2 shows an arrangement for calculating a hamming distance; 
1 5 Figure 3 shows an arrangement for computing CRC; 

Figure 4 shows a hardwired arrangement for computing CRC; 

Figure 5 shows an arrangement for generating a PN sequence; 

Figure 6 shows an arrangement for a Turbo Encoder; and 

Figure 7 shows an arrangement for a convolutional encoder. 
20 Figure 8 is a block diagram of microprocessor unit according to one 

embodiment of the present invention. 

DETAILED DESCRIPTION OF THE INVENTION 

Hamming distance is defined as the number of occurrences of bit 
differences between two variables. For a modulo-2 case, the hamming distance is 
25 defined as the number of bit positions in which the corresponding bits of two binary 
words of the same length are different. This can be calculated by XORing the 
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individual bits and counting the number of occurrences of differences. As an 
example, the hamming distance between 1011101 and 1001001 is two. 

Figure 2 shows the hardware realization of a function herein titled 
WEIGHTQ. The WEIGHT() function calculates the hamming distance between two 
5 variables input to the circuit. The circuit includes a first accumulator 58, a second 
accumulator 60, an XOR gate 62 and a cascade of adders 64. 

The result of the calculation is stored back into the first accumulator 
58. Additionally, a carry flag 66 is provided. The carry flag stores the parity of the 
result, and is set to '0* if even, or *1 ' if odd. 
10 The circuit of Figure 2 has another use. When one of the 

accumulators stores a zero value, the output written back to the first accumulator is 
equal to the number of *1's in the other accumulator. This function is sometimes 
referred to as a ^majority vote'. 

The circuit 100 shown in Figure 3 is used to compute the CRC for a 
15 message sequence provided as an input. The message sequence 102 is shifted 
bit-by-bit serially into the MSB of the shifter 121 and concatenated with the CRC 
register 1 20. The contents of the CRC register are shifted right by one position 
and the CRC register is updated after each shift operation. If the LSB of the shifter 

121 is a 'r, then the output of the shifter 121 is XORed by XOR gate 123 with the 
20 Generator Polynomial 124, else no XOR operation is perfonned. The 2-1 MUX 

122 controls whether an XOR operation is performed or not, under the control of 
the LSB ofthe shifter 121. 

This process is repeated for all the message data bits, after which a 
number of 'O's equal to the length of the generator polynomial are shifted into the 
25 CRC register. After the message sequence and sequence of 'O's have been 

shifted in, the result contained in the CRC register is the remainder ofthe division. 

The remainder is read from the CRC register in reverse bit order. 
The generator polynomial is also programmed in reverse bit order, and the MSB of 
the generator polynomial is not included. 
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Implementing a CRC function in software in the prior art requires 
several steps to perform the necessary shift, compare and XOR functions. 
Moreover, the task is repeated as many times as there are incoming data bits, 
making the process slow for large data words. 
5 Embodiments of the present invention provide hardwired instructions 

which allow software calls to be made to specialized hardware which results in the 
process being performed significantly more speedily, thus freeing up the processor 
to perform other tasks. The specialized hardware is provided as part of an MCU or 
DSP, >A^hich is thus able to offer specialized commands for performing CRC 

10 calculations. Such commands may be implemented more speedily in such a 
specialized processor than in a general purpose processor as the specialized 
hardware can perform the required calculations in fewer instruction cycles. 

For the CRC function, the hardware is implemented in such a way 
that cascading may be used to calculate the CRC for data words having a greater 

1 5 width than the accumulator of the MCU. For instance, if the MCU has a 16-bit 
accumulator, cascading may be used to allow the CRC to be calculated for a data 
word wider than 16-bits. 

The hardware function is implemented as shown in Figure 4. Four 
injDuts are required for circuit 34. The auxiliary input 150, the generator 156, the 

20 CRC register 154 and the carry flag 152. They are shown as feeding into the 
circuit of Figure 4. The CRC parity check, CRC register and generator are 16 bits 
wide in this particular embodiment. The carry bit is 1-bit wide. 

Circuit 50 extracts the LSB of the CRC register by calculating the 
parity of the auxiliary input. The auxiliary input is a copy of the CRC register that 

25 has been processed by software in the MCU by masking the CRC LSB position. 
The hardware is implemented in this way. rather than simply extracting the LSB 
from the CRC register, so that a CRC may be calculated that is not aligned to the 
word length of the CRC register, where the LSB of the CRC register may be in the 
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middle of the word. This implementation also allows hardware to be easily used in 
other coding applications outlined below. 

If the parity output from circuit 50 is determined to be odd, the new 
value of the CRC register becomes {Carry Flag, CRC register [m:!]}, and this is 
5 XORed with Generator [m:0]. If the parity output from circuit 50 is even, then the 
new value of CRC register is {Carry Flag, CRC register [m:1]}, and no XOR 
operation occurs. 

For either odd or even parity, the new value of Carry Flag is simply 
CRC register[0]. 

10 A typical implementation of the circuit shown in Figure 4 uses an m- 

bit wide dual input multiplexer 54, an m-bit input XOR gate as the parity checker 
circuit 50 to perform the parity check, and m dual input XOR gates 52 for 
performing the XOR operation. Circuit 44 shifts the bits of the CRC register 154 by 
one position and shifts in the carry flag 152 as the MSB of the CRC register. The 

1 5 outputs of Figure 4 are a new carry flag 1 58 and a new value of the CRC register 
160. 

The hardwired instruction may be called from software using a 
suitably defined command, such as CRC(&A, &Tn. &C, #value), where &A 
represents the LSB of the CRC register, &Tn represents all bits of the CRC 
20 register, &C represents the carry flag and #value represents the generator 
polynomial. 

If the Generator Polynomial, G(D), is D^® + D^^ + + + D^, this is 
coded into #value as [1 101 0000 0000 1000]. It is coded in reverse order from D° 
to D^^. the value of D^^ is assumed by the hardware. 
25 The following code describes how such a command may be used to 

calculate the CRC for a 16-bit input word, where the accumulator of the MCU is 
also 16-bit 
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step . 

1 Tn = 0 //Initialize CRC register 

2 C = incoming message bit 

3 A = Tn & 1 //Set A = LSB of CRC register by masking 

//LSB using AND operator 

4 . CRC(&A, &Tn, &C, #value) //Call the hardwired CRC instruction 

Steps 2 to 4 are then repeated for all sixteen message data bits. 
After these steps, sixteen 'O's are shifted into the CRC register, and the value 
remaining in the CRC register after this operation is the required CRC. 
5 In the case where the CRC is to be calculated for a word wider than 

the width of the accumulator, the hardwired CRC instruction is called twice. This 
scenario is illustrated in the code set out below for a 21 -bit data word. 

The CRC register is mapped to two registers 

10 CRC[21:5] Tn1[15:0] 

CRC[4:0] Tn2[15:11],whereTn2[11]istheLSB 

An arbitrary generator polynomial, G(D), may be defined as 
D^^ + 0^° + D^^ + D^^ + D'^ + D^^ + + + + + D°. This polynomial is 
1 5 coded onto #value1 and #value2 as: 

valuel: [1101 0011 0001 1000] 
value2: [0101 1000 0000 0000] 

20 As before, this is coded in reverse order from D° to D^°, with D^^ 

being assumed by the hardware embodiment. 

The code to realize the CRC calculation is: 
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step 

1 Tn1=0 

2 Tn2 = 0 

3 C = incoming data bit 

4 A = Tn2 & 0x0800 

5 CRC(&A, &Tn1,&C,#value1) 

6 CRC(&A, &Tn2, &C, #value2) 



//Initialize upper word of CRC register 
//Initialize lower word of CRC register 

//Set A = LSB of CRC register using AND 
//function. Note that LSB is located at bit 
//11 ofTn2 

//Call the hardwired CRC instruction 
//Call the hardwired CRC instruction 



As previously, the steps 3 to 6 are repeated for all bits of the input 

data word: 

The above technique can be adapted for any length of input word, by 
mapping the word onto as many 16-bit registers as are required to hold the entire 
word. Of course, if the accumulator is wider than 16-bits, it will be capable of 
processing input words of greater width. 

It is possible to define two versions of the CRC instruction to deal 
with different situations where data may need to bit-shifted to the left or the right. 
This may be useful in calculating CRC according to different standards which 
require different data structures. The only difference between their 
implementations is in the configuration of the bit-shifters which either shift to the 
left or to the right, and swap the MSB with the LSB and vice-versa. The remaining 
details are identical. 

Each implementation may be called from software using a unique 
command, such as CRCR() for the right shifting version, and CRCLQ for the left 
shifting version. 

Aside from being used to calculate CRC. the basic circuit according 
to embodiments of the invention may be used in other applications including 
Pseudo Noise (PN) generators. Turbo coding and Convolutional Coding. 
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The generation of a PN sequence may be achieved by use of the 
hardwired CRC and WEIGHT instructions preciously defined. To generate a PN 
sequence, the following steps are performed: 

(i) XORIng the PN register with an Output Generator Polynomial 
5 to produce an output; 

(ii) Calculating a feedback bit by XOR'ing the PN register with a 
Feedback Generator Polynomial; 

(iii) Shifting the contents of the PN register right by one position; 
and 

10 (iv) Shifting the feedback bit into the MSB position of the PN 

register. 



Figure 5 shows how step (ii) above is performed. Figure 5 shows a 
string of sequentially connected register 160. A feedback signal 170 is created by 
1 5 combining selected register outputs to feedback to the MSB register. The output 
172 is created by XORing selected register outputs. The register outputs used to 
create the feedback and output signals are selected on the basis of a defined 
generator polynomial. 

The code below demonstrates how the hardwired instructions CRC() 
20 and WEIGHT() are used in the production of a PN sequence. 



To Is used for the PN shift register The MSB of the register is aligned 

to To[1 5] and the LSB is aligned to To[6]. 
Y Temporary PN register used for computing the output and feedback 

25 values. 

Ti Generator polynomial taps for the PN generator output. The MSB of 

the taps is aligned to Ti[1 5] and the LSB is aligned to Ti[6]. 
Ti Generator polynomial taps for the feedback. The MSB of the taps is 

aligned to T2[1 5] and the LSB is aligned to T2[6]. 
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Xn Pointer to PN generator output array. 

C Carry or parity bit output of Weight function. 

#value Is always set to 0x8000 as only the incoming bit has to be XOR'ed. 

Z Is used as input to CRC() function and is always set to 0. 

The following code is written in terms of the variables defined above: 



Step 

1 

2 

3 
4 



6 
7 
8 
9 

10 

11 
12 
13 



To = 0 
Z = 0 

#value = 0x8000 
Ti = 0x3100 

72 = 0x0240 

Y = To 

Y &= OXFFCO 
WEIGHT (Y.Ti) 
*Xn++=C 

Y = To; 

Y &= OxFFCO 

WEIGHT (Y.Tz) 
CRCR(Z.To.C,#value) 



//Initialize PN register 

//initialize input for CRC(), always set to 

zero as no XOR required 

//Initialize input for CRC() 

//Initialize taps for output as shown in 

Fig. 5 

//Initialize taps for feedback as shown in 
Fig. 5 

in 0 calculate the output - STEP #1 
//Masking unused bits of register 
//Store parity in Y for PN output 
//Output is equal to the Carry or Parity bit 
of Weight function 

//For shifting new value into To[15] - STEP 
#2 

//Masking unused bits of register 
//Store parity of feedback bits in C 
//Shift a 1 or 0 into MSB of To depending 
value of C 



Steps 6 to 13 are repeated for the required PN sequence length. 
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. In examples of the invention where the PN register length (N) is 
greater than 16, two or more 16-bit registers are used to represent the PN register 
and two or more 16-bit registers are used represent the generator polynomial, in a 
similar way as described previously for calculating the CRC for a data word of 
5 greater width than the accumulator. In such a case the WEIGHTQ and CRCRQ 
instructions are called more than once as required. 

The WEIGHTQ and CRCQ commands can be used to implement a 
Turbo encoder. The hardware implementation of such an encoder is shown in 
Figure 6. The circuit 180 is arranged to receive successive input bits 182, and to 
10 produce a stream of output bits 184. The input is fed into the first of a string of 
sequentially connected registers 190, and the output is derived from the last in the 
same string of registers. Various feedback signals are combined using XOR gates 
to encode the input data stream. 

The principle behind Turbo encoding is to receive an input bit 182, 
1 5 calculate a feedback bit 1 86, shift the contents of the Turbo register and then 
calculate the output bit 184. This process is repeated as new data input bits are 
shifted into the circuit. 

The code below demonstrates how such a function may be realized 
using the CRCQ and WEIGHT() commands described previously: 

20 

To Is used for the Turbo shift register. The MSB of the register iis 

aligned to To[1 5] and the LSB is aligned to To[1 2]. 
Y Temporary Turbo register used for computing the output and 

feedback parity values. 
25 Z Is used to store the parity of feedback bits. 

T1 Generator polynomial taps for the feedback. The MSB of the taps is 

aligned to Ti[15] and the LSB is aligned to Ti[12]. 
T2 Generator polynomial taps for the Turbo encoder output. The MSB 

of the taps is aligned to T2[1 5] and the LSB is aligned to T2[1 2], 
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Xn 
C 

#value 



Pointer to Turbo encoder output array. 

Carry or parity bit output of Weight function. 

Is always set to 0x8000 as only the incoming bit has to be XORed. 



The following steps are executed: 



Step 

1 

2 



4 

5 
6 
7 
8 

9 

10 

11 
12 
13 
14 



To 
Ti 



0 

0x3000 



Tz = OxDOOO 

#value = 0x8000 

Y = To 

Y &= OxFOOO 
WEIGHT (Y.Ti) 
Z = C 

C = Incoming bit 
CRCR(Z,To,C,#value) 

Y = TO 

Y &= OxFOOO 
WEIGHT (Y.Tz) 
*Xn++= C 



//Initialize Turbo register 

//Initialize taps for feedback as shown in 

Fig. 6 

//Initialize taps for output as shown in 
Fig. 6 

//Initialize #value for CRCQ function 
//To calculate the output - STEP #1 
//Mask the unused bits of the register 
//Store parity of feedback bits in C 
//Z equals to C and becomes input for 
CRC() function 
//Get next input bit 

//Shift To right by one position and shift in 
new MSB 

//T o calculate the output bit - STEP #2 
//Mask the unused bits of the register 
//Store parity in C 

lie is the output of the Turbo encoder 



Steps 5 to 14 are repeated for all the data bits to be encoded. 

A convolutional encoder can also be implemented using the CRCQ 
and WEIGHTQ commands. Figure 7 shows a representation of a convolutional 
encoder (half-rate, constraint length nine) 200. It includes a series of sequentially 
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connected registers 210. The first register in the string, is arranged to receive the 
input data bits 202. The two output data streams 204, 206 are created by XORIng 
certain register outputs as defined by the generator polynomial. New data bits 182 
are fed into the series of registers 210 such that each new bit forms the MSB of the 
5 word defined by the registers. 

In the embodiment presented here, the WEIGHTQ command is Used 
to calculate the output bits and the CRC() command is used to shift the register 
bits right by one position. 

10 To Is used for the convolutional shift register. The MSB of the register is 

aligned to To[1 5] and the LSB is aligned to To[8]. 
A Is set to zero as the XOR operation is not required because the 

CRC() instruction is used only for right shift for implementing the 

convolutional encoder, 
1 5 #value Is of no consequence since A is set to zero and no XOR function is to 

be performed. 

Y Temporary convolutional register used for computing the output 

values. 

Ti Generator polynomial taps for the first encoder output. The MSB of 

20 the taps is aligned to Ti[1 5] and Ti is set to 0x3880. 

T2 Generator polynomial taps for the second encoder output. The MSB 

of the taps is aligned to T2[1 5] and T2 is set to 0x7580. 
C Carry or parity bit output of Weight function. 

Xn Pointer to convolutional encoder output array. 

25 

The following steps are executed: 
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step 

1 To = 0 

2 C = incoming message bit 

3 A=0 

4 CRCR(A,To.C,#value) 

5 Y=To 

6 Y &= OxFFSO 

7 WEIGHT(Y.Ti) 



8 



*Xn++= C 



9 Y=To 

10 Y&= OxFFSO 

1 1 WEIGHT(Y,T2) 

12 *Xn++= C 



//Initialize Convolutional register 

//XOR function not required 

//Shift C into MSB position of To. No XOR 

to be performed. 

//Constraint length 9, masking not required 
bits 

//Store Hamming distance in Y for first 
output 

//Output is equal to the Carry or Parity bit 
of WEIGHTQ function 

//Constraint length 9, masking not required 
bits 

//Store Hamming distance in Y for second 
output 

//Output is equal to the Carry or Parity bit 
of Weight function 



Steps 2 to 12 are repeated for all the data bits to be encoded. 

Thus, it can be seen that by the provision of specialized hardwired 
instructions in an MCU, simple software routines may be developed which allow 
CRC calculations to be easily and quickly performed. 

The basic hardware required to perform CRC calculations, which 
may be called in software, may further be used as a building block to perform the 
other forms of complex coding described above. Indeed, other coding schemes 
not herein described may be produced using the same techniques. 
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Shown in Figure 8 is a block diagram of a microprocessor unit 300 
that implements an embodiment of the present invention. The microprocessor unit 
300 includes an instruction storage unit 305 that stores software instructions to be 
executed. The software instruction may include both basic instructions and 
5 parameters acted on by the instructions. For example, the CRC instruction 
discussed above may include the &A. &Tn, &C, and #value parameters. The 
instruction storage unit 305 may be a buffer or other memory that stores all or part 
of the instructions of a computer program. 

The microprocessor unit 300 also includes a decoder 310, a CRC 

10 hardwired circuit 315, a Weight hardwired circuit 320, and an execution unit 325 
interconnected by a bus 330. The decoder 310 receives an instruction from the 
instruction storage unit 305, determines whether the instruction is a standard 
instruction or one of the CRC and Weight instructions discussed above. If the 
received instruction is a standard instruction, the decoder 310 passes the 

1 5 instruction on the bus 330 to the execution unit 325 which executes the instruction 
normally. If the received instruction is the CRC instruction, the decoder 310 
passes the instruction to the CRC hardwired circuit 315, which implements the 
CRC instruction in hardware as discussed above with respect to Figures 3-4. If the 
received instruction is the Weight instruction, the decoder 310 passes the 

20 instruction to the Weight hardwired circuit 320, which implements the Weight 
instruction in hardware as discussed above with respect to Figure 2. 

All of the above U.S. patents, U.S. patent application publications, 
U.S. patent applications, foreign patents, foreign patent applications and non- 
patent publications referred to in this specification and/or listed in the Application 

25 Data Sheet are incorporated herein by reference, in their entirety. 

The present invention Includes any novel feature or combination of 
features disclosed herein either explicitly or any generalization thereof irrespective 
of whether or not it relates to the claimed invention or mitigates any or all of the 
problems addressed. 
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